Linear Classification and Selective Sampling Under Low Noise Conditions
نویسندگان
چکیده
We provide a new analysis of an efficient margin-based algorithm for selective sampling in classification problems. Using the so-called Tsybakov low noise condition to parametrize the instance distribution, we show bounds on the convergence rate to the Bayes risk of both the fully supervised and the selective sampling versions of the basic algorithm. Our analysis reveals that, excluding logarithmic factors, the average risk of the selective sampler converges to the Bayes risk at rate N where N denotes the number of queried labels, and α > 0 is the exponent in the low noise condition. For all α > √ 3− 1 ≈ 0.73 this convergence rate is asymptotically faster than the rate N achieved by the fully supervised version of the same classifier, which queries all labels, and for α→ ∞ the two rates exhibit an exponential gap. Experiments on textual data reveal that simple variants of the proposed selective sampler perform much better than popular and similarly efficient competitors.
منابع مشابه
Adaptive Sampling Under Low Noise Conditions
We survey some recent results on efficient margin-based algorithms for adaptive sampling in binary classification tasks. Using the so-called Mammen-Tsybakov low noise condition to parametrize the distribution of covariates, and assuming linear label noise, we state bounds on the convergence rate of the adaptive sampler to the Bayes risk. These bounds show that, excluding logarithmic factors, th...
متن کاملNoise-Adaptive Margin-Based Active Learning and Lower Bounds under Tsybakov Noise Condition
We present a simple noise-robust margin-based active learning algorithm to find homogeneous (passing the origin) linear separators and analyze its error convergence when labels are corrupted by noise. We show that when the imposed noise satisfies the Tsybakov low noise condition (Mammen, Tsybakov, and others 1999; Tsybakov 2004) the algorithm is able to adapt to unknown level of noise and achie...
متن کاملLearning Probabilistic Linear-Threshold Classifiers via Selective Sampling
In this paper we investigate selective sampling, a learning model where the learner observes a sequence of i.i.d. unlabeled instances each time deciding whether to query the label of the current instance. We assume that labels are binary and stochastically related to instances via a linear probabilistic function whose coefficients are arbitrary and unknown. We then introduce a new selective sam...
متن کاملEnhancing Particle Filters Using Local Likelihood Sampling
Particle filters provide a means to track the state of an object even when the dynamics and the observations are non-linear/nonGaussian. However, they can be very inefficient when the observation noise is low as compared to the system noise, as it is often the case in visual tracking applications. In this paper we propose a new two-stage sampling procedure to boost the performance of particle f...
متن کاملThe Relation between Hearing Loss and Smoking among Workers Exposed to Noise, Using Linear Mixed Models
Introduction: Noise is one of the most common and harmful physical factors in the working environment and has physical and psychological effects on individuals. In this study, the audiometry results of industrial workers were modeled and the effect of noise and other factors on hearing loss was examined. Materials and Methods: ...
متن کامل